VTC: Improving Video-Text Retrieval with User Comments

نویسندگان

چکیده

AbstractMulti-modal retrieval is an important problem for many applications, such as recommendation and search. Current benchmarks even datasets are often manually constructed consist of mostly clean samples where all modalities well-correlated with the content. Thus, current video-text literature largely focuses on video titles or audio transcripts, while ignoring user comments, since users tend to discuss topics only vaguely related video. Despite ubiquity comments online, there currently no multi-modal representation learning that includes comments. In this paper, we a) introduce a new dataset videos, comments; b) present attention-based mechanism allows model learn from sometimes irrelevant data c) show by using our method able better, more contextualised, representations image, representations. Project page: https://unitaryai.github.io/vtc-paper.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Examining User Interactions with Video Retrieval Systems

The Informedia group at Carnegie Mellon University has since 1994 been developing and evaluating surrogates, summary interfaces, and visualizations for accessing digital video collections containing thousands of documents, millions of shots, and terabytes of data. This paper reports on TRECVID 2005 and 2006 interactive search tasks conducted with the Informedia system by users having no knowled...

متن کامل

Improving Multimedia Retrieval with a Video OCR

We present a set of experiments with a video OCR system (VOCR) tailored for video information retrieval and establish its importance in multimedia search in general and for some specific queries in particular. The system, inspired by an existing work on text detection and recognition in images, has been developed using techniques involving detailed analysis of video frames producing candidate t...

متن کامل

Improving Text Summarization Using Noun Retrieval Techniques

Text Summarization and categorization have always been two of the most demanding information retrieval tasks. Deploying a generalized, multifunctional mechanism that produces good results for both of the aforementioned tasks seems to be a panacea for most of the text-based, information retrieval needs. In this paper, we present the keyword extraction techniques, exploring the effects that part ...

متن کامل

Improving the Automatic Retrieval of Text Documents

This paper reports on a statistical stemming algorithm based on link analysis. Considering that a word is formed by a prefix (stem) and a suffix, the key idea is that the interlinked prefixes and suffixes form a community of sub-strings. Thus, discovering these communities means searching for the best word splits that give the best word stems. The algorithm has been used in our participation in...

متن کامل

Improving Cross-Language Text Retrieval with Human Interactions

Can we expect people to be able to get information from texts in languages they cannot read? In this paper we review two relevant lines of research bearing on this question and will show how our results are being used in the design of a new Web interface for cross-language text retrieval. One line of research, “Interactive IR”, is concerned with the user interface issues for information retriev...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Lecture Notes in Computer Science

سال: 2022

ISSN: ['1611-3349', '0302-9743']

DOI: https://doi.org/10.1007/978-3-031-19833-5_36